Picture for Ranjay Krishna

Ranjay Krishna

JobBench: Aligning Agent Work With Human Will

Add code
May 25, 2026
Viaarxiv icon

Ablate-to-Validate: Are Vision-Language Models Really Using Continuous Thought Tokens?

Add code
May 20, 2026
Viaarxiv icon

RefDecoder: Enhancing Visual Generation with Conditional Video Decoding

Add code
May 14, 2026
Viaarxiv icon

VideoNet: A Large-Scale Dataset for Domain-Specific Action Recognition

Add code
May 05, 2026
Viaarxiv icon

MolmoAct2: Action Reasoning Models for Real-world Deployment

Add code
May 04, 2026
Viaarxiv icon

You Only Judge Once: Multi-response Reward Modeling in a Single Forward Pass

Add code
Apr 13, 2026
Viaarxiv icon

WildDet3D: Scaling Promptable 3D Detection in the Wild

Add code
Apr 09, 2026
Viaarxiv icon

MolmoWeb: Open Visual Web Agent and Open Data for the Open Web

Add code
Apr 09, 2026
Viaarxiv icon

MolmoPoint: Better Pointing for VLMs with Grounding Tokens

Add code
Mar 30, 2026
Viaarxiv icon

PerceptionComp: A Video Benchmark for Complex Perception-Centric Reasoning

Add code
Mar 27, 2026
Viaarxiv icon